Picture for Shiwei Zhang

Shiwei Zhang

ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning

Add code
Dec 11, 2025
Figure 1 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 2 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 3 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Figure 4 for ReViSE: Towards Reason-Informed Video Editing in Unified Models with Self-Reflective Learning
Viaarxiv icon

Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance

Add code
Dec 09, 2025
Viaarxiv icon

Self-Contradiction as Self-Improvement: Mitigating the Generation-Understanding Gap in MLLMs

Add code
Jul 22, 2025
Viaarxiv icon

UniAnimate-DiT: Human Image Animation with Large-Scale Video Diffusion Transformer

Add code
Apr 15, 2025
Viaarxiv icon

Taming Consistency Distillation for Accelerated Human Image Animation

Add code
Apr 15, 2025
Viaarxiv icon

Wan: Open and Advanced Large-Scale Video Generative Models

Add code
Mar 26, 2025
Figure 1 for Wan: Open and Advanced Large-Scale Video Generative Models
Figure 2 for Wan: Open and Advanced Large-Scale Video Generative Models
Figure 3 for Wan: Open and Advanced Large-Scale Video Generative Models
Figure 4 for Wan: Open and Advanced Large-Scale Video Generative Models
Viaarxiv icon

DreamRelation: Relation-Centric Video Customization

Add code
Mar 10, 2025
Viaarxiv icon

SimpleVQA: Multimodal Factuality Evaluation for Multimodal Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion

Add code
Dec 12, 2024
Figure 1 for FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Figure 2 for FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Figure 3 for FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Figure 4 for FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion
Viaarxiv icon

Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model

Add code
Nov 28, 2024
Figure 1 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Figure 2 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Figure 3 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Figure 4 for Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
Viaarxiv icon